Optimal game theoretic solution of the pursuit‐evasion intercept problem using on‐policy reinforcement learning

نویسندگان

چکیده

Abstract This article presents a rigorous formulation for the pursuit‐evasion (PE) game when velocity constraints are imposed on agents of or players. The is formulated as an infinite‐horizon problem using non‐quadratic functional, then sufficient conditions derived to prove capture in finite‐time. A novel tracking Hamilton–Jacobi–Isaacs (HJI) equation associated with value function employed, which solved Nash equilibrium policies each agent arbitrary nonlinear dynamics. In contrast existing remedies proof PE game, proposed method does not assume players moving their maximum velocities and considers priori. Attaining optimal actions requires solution HJI equations online real‐time. We overcome this by presenting on‐policy iteration integral reinforcement learning (IRL) technique. persistence excitation IRL work satisfied inherently until occurs, at time ends. Furthermore, backstepping control track desired trajectories generalized Newtonian Simulation results provided show validity methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Adaptive Learning Game for Autistic Children using Reinforcement Learning and Fuzzy Logic

This paper, presents an adapted serious game for rating social ability in children with autism spectrum disorder (ASD). The required measurements are obtained by challenges of the proposed serious game. The proposed serious game uses reinforcement learning concepts for being adaptive. It is based on fuzzy logic to evaluate the social ability level of the children with ASD. The game adapts itsel...

متن کامل

A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning

To achieve general intelligence, agents must learn how to interact with others in a shared environment: this is the challenge of multiagent reinforcement learning (MARL). The simplest form is independent reinforcement learning (InRL), where each agent treats its experience as part of its (non-stationary) environment. In this paper, we first observe that policies learned using InRL can overfit t...

متن کامل

Reinforcement Learning in Distributed Domains: An Inverse Game Theoretic Approach

We consider the design of multi-agent systems (MAS) so as to optimize an overall world utility function when each agent in the system runs a Reinforcement Learning (RL) algorithm based on own its private utility function. Traditional game theory deals with the "forward problem" of determining the state of a MAS that will ensue from a specified set of private utilities of the individual agents. ...

متن کامل

the relationship between using language learning strategies, learners’ optimism, educational status, duration of learning and demotivation

with the growth of more humanistic approaches towards teaching foreign languages, more emphasis has been put on learners’ feelings, emotions and individual differences. one of the issues in teaching and learning english as a foreign language is demotivation. the purpose of this study was to investigate the relationship between the components of language learning strategies, optimism, duration o...

15 صفحه اول

A Cryptographic Solution to a Game Theoretic Problem

Although Game Theory and Cryptography seem to have some similar scenarios in common, it is very rare to find instances where tools from one area are applied in the other. In this work we use cryptography to solve a game theoretic problem. The problem that we discuss arises naturally in the game theory area of two-party strategic games. In these games there are two players. Each player decides o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Robust and Nonlinear Control

سال: 2021

ISSN: ['1049-8923', '1099-1239']

DOI: https://doi.org/10.1002/rnc.5719